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DNA RECOMBINATION IN EUKARYOTIC CELLS BY THE 
BACTERIOPHAGE PHIC31 RECOMBINATION SYSTEM 

5 

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH 

This invention was made with government support under Grant No. 5335- 
21000-009-06S, awarded by the United States Department of Agriculture, Agricultural 
Research Service. The Government has certain rights in the invention. 

10 CROSS-REFERENCE TO RELATED APPLICATION 

This appUcation claims the benefit of US Provisional Application No. 
60/145,469, filed July 23, 1 999, which application is incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

15 This invention pertains to the field of methods for obtaining specific and 

stable integration of nucleic acids into chromosomes of eukaryotes. The invention makes use 
of site-specific recombination systems that use prokaryotic recombinase polypeptides, such 
as the OC3 1 integrase. 

Background 

20 Genetic transformation of eukaryotes often suffers fix)m significant 

shortcomings. For example, it is often difiBcult to reproducibly obtain integration of a 
transgene at a particular locus of interest. Homologous recombination generally occurs only 
at a very low fi*equency. To overcome this problem, site-specific recombination systems 
have been employed. These methods involve the use of site-specific recombination systems 

25 that can operate in higher eucaryotic cells. 

Many bacteriophage and integrative plasmids encode site-specific 
recombination systems that enable the stable incorporation, of their genome into those of 
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their hosts. In these systems, the minimal requirements for the recombination reaction are a 
recombinase enzyme, or integrase, which catalyzes the recombination event, and two 
recombination sites (Sadowski (1986)7. Bacteriol 165: 341-347; Sadowski (1993) F^SE^ 
J. 7: 760-767). For phage integration systems, these are referred to as attachment {att) sites, 
5 with an attP element from phage DNA and the attB element encoded by the bacterial 

genome. The two attachment sites can share as little sequence identity as a few base pairs. 
The recombinase protein binds to both att sites and catalyzes a conservative and reciprocal 
exchange of DNA strands that result in integration of the circular phage or plasmid DNA 
into host DNA. Additional phage or host factors, such as the DNA bending protein IHF, 
10 integration host /actor, may be required for an efficient reaction (Friedman (1 988) Cell 

55:545-554; Finkel & Johnson (1992) Mol Microbiol 6: 3257-3265). The reverse excision 
reaction sometimes requires an additional phage factor, such as the xis gene product of phage 
;i (Weisberg & Landy (1983) "Site-specific recombination in phage lambda." In Lambda II, 
eds. Hendrix et al (Cold Spring Harbor Laboratory, Cold Spring Harbor, NY) pp.2 1 1 -250; 
15 Landy (1989) Ann. Rev. Biochem, 58: 913-949. 

The recombinases have been categorized into two groups, the A. integrase 
(Argos et al (1986) EMBO J. 5: 433-44; Voziyanov et al (1999) Nucl Acids Res, 11: 930- 
941) and the resolvase/invertase (HatfuU & Grindley ( 1 988) 'Hesolvases and DNA- 
invertases: a family of enzymes active in site-specific recombination** In Genetic 
20 Recombination, eds. Kucherlipati, R., & Smith, G. R. (Am. Soc. Microbiol., Washington 
DC), pp. 357-396) families. These vary in the structure of the integrase enzymes and the 
molecular details of their mode of catalysis (Stark et al (1992) Trends Genetics 8: 432-439). 
The temperate Streptomyces phage <DC31 encodes a 68 kD recombinase of the latter class. 
The efiBcacy of the OC3 1 integrase enzyme in recombining its cognate attachment sites was 
25 recently demonstrated in vitro and in vivo in recA mutant Escherichia coli (Thorpe & Smith 
(1998) Proc, Nat'l Acad, Scl USA 95: 5505-5510). The <I)C31 integration reaction is simple 
in that it does not require a host factor and appears irreversible, most likely because an 
additional phage protein is required for excision. The phage and bacterial att sites share only 
three base pairs of homology at the point of cross-over. This homology is flanked by 
30 inverted repeats, presumably binding sites for the integrase protein. The minimal known 
functional size for both attB and attP is --50 bp. 
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The Cre-/ox system of bacteriophage PI, and the FLP-FRT system of 
Saccharomyces cerevisiae have been widely used for transgene and chromosome 
engineering in animals and plants (reviewed by Sauer (1994) Curr, Opin. BiotechnoL 5: 521- 
527; Ow (1996) Curr, Opin. BiotechnoL 7: 181-186). Other systems that operate in animal 

5 or plant cells include the following: 1) the R-i?5 system from Zygosaccharomyces rouxii 
(Onouchi et al (1995) Mol Gen, Genet. 247: 653-660), 2) the Gin-gix system from 
bacteriophage Mu (Maeser & Kahmann (1991) Mol Gen, Genet, 230: 170-176) and, 3) the p 
recombinase-jix system from bacterial plasmid pSM19035 (Diaz et al, (1999) J, Biol Chem, 
274: 6634-6640). By using the site-specific recombinases, one can obtain a greater frequency 

10 of integration. 

However, these five systems suffer from a significant shortcoming. Each of 
these systems have in common the property that a single polypeptide recombinase catalyzes 
the recombination between two sites of identical or nearly identical sequences. The product- 
sites generated by recombination are themselves substrates for subsequent recombination. 
1 5 Consequently, recombination reactions are readily reversible. Since the kinetics of 

intramolecxilar interactions are favored over mtermolecular interactions, these recombination 
systems are efiBcient for deleting rather than integrating DNA. Thus, a need exists for 
methods and systems for obtaining stable site-specific integration of transgenes. The present 
invention fiilfills this and other needs. 

20 SUMIVIARYOFTHEBVVENTION 

The present mvention provides methods for obtaining stable, site-specific 
recombination in a eukaiyotic cell. Unlike previously known methods for site-specific 
recombination, the recombinants obtained using the methods of the invention are stable. The 
recombination reaction is not reversible. 

25 The methods involve providing a eukaryotic cell that comprises a first 

recombination site and a second recombination site, which second recombination site can 
serve as a substrate for recombination with tiie first recombination site. The first and the 
second recombination sites are contacted with a prokaryotic recombinase polypeptide, 
resulting in recombination between the recombination sites, thereby forming one or two 

30 hybrid recombination sites. Significantly, the recombinase polypeptide is one that can 
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mediate site-specific recombination between the first and second recombination sites, but 
cannot mediate recombination between the two hybrid recombination sites in the absence of 
an additional phage-produced factor that is not present in the eukaryotic cell. Either or both 
of the recombination sites can be present in a chromosome of the eukaryotic cell. In some 

5 embodiments, one of the recombination sites is present in the chromosome and the other is 
included within a nucleic acid that is to be integrated into the chromosome. 

The invention also provides eukaryotic cells that contain a prokaryotic 
recombinase polypeptide or a nucleic acid that encodes a prokaryotic recombinase. In these 
embodiments, the recombinase is one that can mediate site-specific recombination between a 

10 first recombination site and a second recombination site that can serve as a substrate for 
recombination with the first recombination site, but in the absence of an additional factor 
that is not present in the eukaryotic cell cannot mediate recombination between two hybrid 
recombination sites that are formed upon recombination between the first recombination site 
and the second recombination site. In presently preferred embodiments, the cells of the 

1 5 invention include a nucleic acid that has a coding sequence for a recombinase polypeptide. 
The recombinase coding sequence is preferably operably linked to a promoter that mediates 
expression of the recombinase-encoding polynucleotide in the eukaryotic cell. The 
eukaryotic cells of the invention can be an animal cell, a plant cell, a yeast cell or a fimgal 
cell, for example. 

20 In additional embodiments, the invention provides methods for obtaining a 

eukaryotic cell having a stably integrated transgene. These methods involve introducing a 
nucleic acid into a eukaryotic cell that comprises a first recombination site, wherein the 
nucleic acid comprises the transgene of interest and a second recombination site which can 
serve as a substrate for recombination with the first recombination site. The first and second 

25 recombination sites are contacted with a prokaryotic recombinase polypeptide. The 
recombinase polypeptide catalyzes recombination between the first and second 
recombination sites, resulting in integration of the nucleic acid at the first recombination site, 
thereby forming a hybrid recombination site at each end of the nucleic acid. Again, the 
recombinase polypeptide is one that can mediate site-specific recombination between the 

30 first and second recombination sites, but cannot mediate recombination between two hybrid 
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recombination sites in the absence of an additional factor that is not present in the eukaryotic 
cell. 

Additional embodiments of the invention provide nucleic acids that include a 
polynucleotide sequence that encodes a bacterial recombinase polypeptide operably linked to 

5 a promoter that functions in a eukaryotic cell. The recombinase polypeptides encoded by 
these nucleic acids of the invention cannot mediate recombination between two hybrid 
recombination sites that are formed upon recombination between a first recombination site 
and a second recombination site in the absence of a bacteriophage factor that is not present in 
the eukaryotic cells, hi some embodiments, the nucleic acids further include at least one 

10 recombination site that is recognized by the recombinase polypeptide. 

Also provided by the invention are eukaryotic cells that include a 
polynucleotide that has one or more bacteriophage <DC3 1 recombination sites, or 
recombination sites for other recombinases that cannot mediate recombination between two 
hybrid recombination sites that are formed upon recombination between a first 

1 5 recombination site and a second recombination site in the absence of a bacteriophage factor 
that is not present in the eukaryotic cells. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a schematic (not to scale) representation of the chromosome 
structure at the S, pombe leul locus. Homologous insertion of pLT44 into the chromosome 

20 (Figure 1 A) places a OC3 1 attP target between leul alleles as shown in Figure IB. pLT43 
promoted site-specific integration of pLT45 into the chromosomal attP target leads to the 
structure shown in Figure IC. Arrowheads indicate PGR primers corresponding to the T7 
promoter (T7), T3 promoter (T3) and ura4^ coding region (U4). Predicted sizes ofXbql (X) 
cleavage products are shown. 

25 Figure 2 shows a schematic of an experiment which demonstrated that OC3 1 

integrase catalyzes site-specific integration of a transgene encoding green fluorescent protein 
(GFP)in CHO cells. 

Figure 3 shows a schematic diagram of an experiment which demonstrated 
that <DC31 catalyzes specific recombination at an attB site to insert a hygromycin 

30 phosphotransferase gene downstream of a chromosomally located promoter. Successfiil 
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integration produces a Pc-attL-hpt linkage and a hygromycin resistance phenotype. The 
effect of different lengths of attP and attB sites were analyzed using the plasmids indicated. 

Figure 4 shows a schematic diagram of an experiment which demonstrates 
that (DC31 integrase catalyzes the excision of a DNA flanked by attB and attP sites from the 
5 tobacco genome. 

Figure 5 shows a schematic diagram of an experiment in which <I>C31 
integrase was shown to catalyze integration of a transgene into the tobacco genome. 

DETAILED DESCRIPTION 

Definitions 

10 An "exogenous DNA segment", "heterologous polynucleotide" a 'transgene" 

or a "heterologous nucleic acid", as used herein, is one that originates from a source foreign 
to the particular host cell, or, if from the same source, is modified from its original form. 
Thus, a heterologous gene in a host cell includes a gene that is endogenous to the particular 
host cell, but has been modified. Thus, the terms refer to a DNA segment which is foreign or 

15 heterologous to the cell, or homologous to the cell but in a position within the host cell 
nucleic acid in which the element is not ordinarily found. Exogenous DNA segments are. 
expressed to yield exogenous polypeptides. 

The term "gene" is used broadly to refer to any segment of DNA associated 
with a biological function. Thus, genes include coding sequences and/or the regulatory 

20 sequences required for their expression. Genes can also include nonexpressed DNA 

segments that, for example, form recognition sequences for other proteins. Genes can be 
obtained from a variety of sources, including cloning from a source of interest or 
synthesizing from known or predicted sequence information, and may include sequences 
designed to have desired parameters. 

25 The term "isolated", when applied to a nucleic acid or protein, denotes that 

the nucleic acid or protein is essentially free of other cellular components with which it is 
associated in the natural state. It is preferably in a homogeneous state although it can be in 
either a dry or aqueous solution. Purity and homogeneity are typically determined using 
analytical chemistry techniques such as polyacrylamide gel electrophoresis or high 

30 perfomiance liquid chromatography. A protein which is the predominant species present in a 
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prq)aration is substantially purified. In particular, an isolated gene is separated from open 
reading frames which flank the gene and encode a protein other than the gene of interest. 
The term "purified" denotes that a nucleic acid or protein gives rise to essentially one band 
in an electrophoretic gel. Particularly, it means that the nucleic acid or protein is at least 
5 about 50% pure, more preferably at least about 85% pure, and most preferably at least about 
99% pure. 

The term **naturally-occurring" is used to describe an object that can be found 
in nature as distinct from being artificially produced by man. For example, a polypeptide or 
polynucleotide sequence that is present in an organism (including viruses) that can be 

10 isolated from a source in nature and which has not been intentionally modified by man in the 
laboratory is naturally-occiming. 

The term "nucleic acid" or "polynucleotide" refers to deoxyribonucleotides 
or ribonucleotides and polymers thereof in either single- or double-stranded form. Unless 
specifically limited, the term encompasses nucleic acids containing known analogues of 

15 natural nucleotides which have similar binding properties as the reference nucleic acid and 
X are metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise 
indicated, a particular nucleic acid sequence also implicitiy encompasses conservatively 
modified variants thereof (eg. degenerate codon substitutions) and complementary 
sequences and as well as the sequence expUcitly indicated. Specifically, degenerate codon 

20 substitutions may be achieved by generating sequences in which the third position of one or 
more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues 
(Batzer et al (1991) Nucleic Acid Res. 19: 5081; Ohtsuka et al (1985) J. Biol Chem. 260: 
2605-2608; Cassol et al. (1992) ; Rossolini et al (1994) Mol Cell Probes 8: 91-98). The 
temi nucleic acid is used interchangeably with gene, cDNA, and mRNA encoded by a gene. 

25 *'Nucleic acid derived fi?om a gene" refers to a nucleic acid for whose 

synthesis a gene, or a subsequence thereof (e.g., coding region), has ultimately served as a 
template. Thus, an mRNA, a cDNA reverse transcribed from an mRNA, an RNA transcribed 
fit)m that cDNA, a DNA amplified from the cDNA, an RNA transcribed from the amplified 
DNA, etc, are all derived from the gene and detection of such derived products is indicative 

30 of the presence and/or abundance of the original. 
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A DNA segment is "operably linked" when placed into a functional 
relationship with another DNA segment. For example, DNA for a signal sequence is 
operably linked to DNA encoding a polypeptide if it is expressed as a preprotein that 
participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to 

S a coding sequence if it stimulates the transcription of the sequence. Generally, DNA 

sequences that are operably linked are contiguous, and in the case of a signal sequence both 
contiguous and in reading phase. However, enhancers, for example, need not be contiguous 
with the coding sequences whose transcription they control. Linking is accomplished by 
ligation at convenient restriction sites or at adapters or linkers inserted in lieu thereof. 

1 0 *Tlant" includes whole plants, plant organs (e.g., leaves, stems, roots, etc.), 

seeds and plant cells and progeny of same. The class of plants that can be used in the 
methods of the invention is generally as broad as the class of higher plants amenable to 
transformation techniques, including both monocotyledonous and dicotyledonous plants. 
'^Promoter" refers to a region of DNA involved in binding the RNA 

15 polymerase to initiate transcription. An "inducible promoter*' refers to a promoter that directs 
expression of a gene where the level of expression is alterable by environmental or 
developmental factors such as, for example, temperature, pH, transcription factors and 
chemicals. 

The term "recombinant" when used with reference to a cell indicates that the 
20 cell replicates a heterologous nucleic acid, or expresses a peptide or protein encoded by a 
heterologous nucleic acid. Recombinant cells can contain polynucleotides that are not found 
within the native (non-recombinant) form of the cell. Recombinant cells can also contain 
polynucleotides found in the native form of the cell wherein the polynucleotides are 
modified and re-introduced into the cell by artificial means. The term also encompasses cells 
25 that contain a nucleic acid endogenous to the cell that has been modified without removing 
the nucleic acid from the cell; such modifications include those obtained by gene 
replacement, site-specific mutation, and related techniques. 

A "recombinant expression cassette" or simply an "expression cassette" is a 
nucleic acid construct, generated recombinantly or synthetically, with nucleic acid elements 
30 that are capable of eflfecting expression of a structural gene in hosts compatible with such 
sequences. Expression cassettes include at least promoters and optionally, transcription 
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tennination signals. Typically, the recombinant expression cassette includes a nucleic acid 

to be transcribed (e.g., a nucleic acid encoding a desired polypeptide), and a promoter. 

Additional factors necessary or helpful in effecting expression may also be used as described 

herein. For example, an expression cassette can also include nucleotide sequences that 
5 encode a signal sequence that directs secretion of an expressed protein from the host cell. 

Transcription termination signals, enhancers, and other nucleic acid sequences that influence 

gene expression, can also be included in an expression cassette. 

"Recombinase" refers to an enzyme that catalyzes recombination between 

two or more recombination sites. Recombinases useful in the present invention catalyze 
10 recombination at specific recombination sites which are specific polynucleotide sequences 

that are recognized by a particular recombinase. The term "integrase" refers to a type of 

recombinase. 

'Transformation rate" refers to the percent of cells that successfully 

incorporate a heterologous polynucleotide into its genome and survive. 

1 5 The term 'transgenic" refers to a cell that mcludes a specific modification that 

was introduced into the cell, or into an ancestor of the cell. Such modifications can include 
one or more point mutations, deletions, insertions, or combiixations thereof. When referring 
to an animal, the temi ^transgenic" means that the animal includes cells that are transgenic. 
An animal that is composed of both transgenic and non-transgenic cells is referred to herein 

20 as a "chimeric" animal. 

The terai "Vector" refers to a composition for transferring a nucleic acid (or 
nucleic acids) to a host cell. A vector comprises a nucleic acid encoding the nucleic acid to 
be transferred, and optionally comprises a viral capsid or other materials for facilitating entry 
of the nucleic acid into the host cell and/or replication of the vector in the host cell (e.g., 

25 reverse transcriptase or other enzymes which are packaged within the c^sid, or as part of 
the capsid). 

•^Recombination sites" are specific polynucleotide sequences that are 
recognized by the recombinase enzymes described herein. Typically, two different sites are 
mvolved (termed "complementary sites"), one present in the target nucleic acid (e.g., a 
30 chromosome or episome of a eukaryote) and another on the nucleic acid that is to be 
integrated at the target recombination site. The tenns "a/r5" and "aftP," which refer to 
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attachment (or recombination) sites originally from a bacterial target and a phage donor, 
respectively, are used herein although recombination sites for particular enzymes may have 
different names. The recombination sites typically include left and right arms separated by a 
core or spacer region. Thus, an attB recombination site consists of BOB', where B and B' are 

5 the left and right arms, respectively, and.O is the core region. Similarly, attP is POP', where 
P and P' are the arms and O is again the core region. Upon recombination between the attB 
and attP sites, and concomitant integration of a nucleic acid at the target, the recombination 
sites that flank the integrated DNA are referred to as "a/^L" and "a/ri?." The attL and attR 
sites, using the terminology above, thus consist of BOP' and POB', respectively. In some 

1 0 representations herein, the "0" is omitted and attB and attP, for example, are designated as 
BB' and PP', respectively. 

Description of the Preferred Embodiments 

The present invention provides methods for obtaining site-specific 
recombination in eukaryotic cells. Unlike previously known systems for obtaining site- 

1 5 specific recombination, the products of the recombinations performed using the methods of 
the invention are stable. Thus, one can use the methods to, for example, introduce transgenes 
into chromosomes of eukaryotic cells and avoid the excision of the transgene that often 
occurs using previously known site-specific recombination systems. Stable inversions, 
translocations, and other rearrangements can also be obtained. 

20 The invention employs prokaryotic recombinases, such as bacteriophage 

integrases, that are unidirectional in that they can catalyze recombmation between two 
complementary recombination sites, but cannot catalyze recombination between the hybrid 
sites that are formed by this recombination. One such recombinase, the 003 1 integrase, by 
itself catalyzes only the attB x attP reaction. The integrase cannot mediate recombination 

25 between the attL and attR sites that are formed upon recombination between attB and attP. 
Because recombinases such as the cDC31 integrase cannot alone catalyze the reverse 
reaction, the OC31 attB x attP recombination is stable. This property is one that sets the 
methods of the present invention apart from site-specific recombination systems currently in 
use for eucaryotic cells, such as the Cre-/ojc or FLP-Fi?r system, where the recombination 

30 reactions can readily reverse. Use of the recombination systems of the invention provides 


10 


wo 01/07572 


PCT/USOO/19983 


new opportunities for directing stable transgene and chromosome rearrangements in 
eukaryotic cells. 

The methods involve contacting a pair of recombination sites (e.g., attB and 
attP) that are present in a eukaryotic cell with a corresponding recombinase. The 

5 recombinase then mediates recombination between the recombination sites. Depending upon 
the relative locations of the two recombination sites, any one of anumber of events can 
occur as a result of the recombination. For example, if the two recombination sites are 
present on different nucleic acid molecules, the recombination can result in integration of 
one nucleic acid molecule into a second molecule. Thus, one can obtain integration of a 

10 plasmid that contains one recombination site into a eukaryotic cell chromosome that includes 
the corresponding recombination site. Because the recombinases used in the methods of the 
invention cannot catalyze the reverse reaction, the integration is stable. Such methods are 
useful, for example, for obtaining stable integration into the eukaryotic chromosome of a 
transgene that is present on the plasmid. 

15 The two recombination sites can also be present on the same nucleic acid 

molecule. In such cases, the resulting product typically depends upon the relative orientation 
of the sites. For example, recombination between sites that are in the direct orientation will 
generally result in excision of any DNA that lies between the two recombination sites. In 
contrast, recombination between sites that are in the reverse orientation can result in 

20 inversion of the intervening DNA. Again, the resulting rearranged nucleic acid is stable in 
that the recombination is irreversible in the absence of an additional factor, generally 
encoded by the particular bacteriophage from which the recombinase is derived, that is not 
normally foimd in eukaryotic cells. One example of an ^plication for which this method is 
useful involves the placement of a promoter between the two recombination sites. If the 

25 promoter is initially in the opposite orientation relative to a coding sequence that is to be 
expressed by the promoter and the recombination sites that flank the promoter are in the 
inverted orientation, contacting the recombination sites will result in inversion of the 
promoter, thus placing the promoter in the correct orientation to drive expression of the 
coding sequence. Similarly, if the promoter is initially in the correct orientation for 

30 expression and the recombination sites are in the same orientation, contacting the 
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recombination sites with the promoter can result in excision of the promoter fragment, thus 
stopping expression of the coding sequence. 

The methods of the invention are also useful for obtaining translocations of 
chromosomes, for example. In these embodiments, one recombination site is placed on one 
chromosome and a second recombination site that can serve as a substrate for recombination 
with the first recombination site is placed on a second chromosome. Upon contacting the two 
recombination sites with a recombinase, recombination occurs that results in swapping of the 
two chromosome aims. For example, one can construct two strains of an organism, one 
strain of which includes the first recombination site and the second strain that contains the 
second recombination site. The two strams are then crossed, to obtain a progeny strain that 
includes both of the recombination sites. Upon contacting the sites with the recombinase, 
chromosome arm swapping occurs. 

Recombinases and Recombination Sites 

The methods of the invention use recombinase systems to achieve stable 
integration or other rearrangement of nucleic acids in eukaiyotic cells. A recombinase 
system typically consists of three elements: two specific DNA sequences C*the 
recombination sites") and a specific enzyme C*the recombinase'*). The recombinase catalyzes 
a recombination reaction between the specific recombination sites. 

Recombination sites have an orientation. In other words, they are not 
palindromes. The orientation of the recombination sites in relation to each other determines 
what recombination event takes place. The recombination sites may be in two different 
orientations: parallel (same direction) or opposite. When the recombination sites are present 
on a single nucleic acid molecule and are in a parallel orientation to each other, then the 
recombination event catalyzed by the recombinase is a typically an excision of the 
intervening nucleic acid, leaving a single recombination site. When the recombination sites 
are in the opposite orientation, then any intervening sequence is typically inverted. 

The recombinases used in the methods of the invention can mediate site- 
specific recombination between a first recombination site and a second recombination site 
that can serve as a substrate for recombination with the first recombination site. However, in 
the absence of an additional factor that is not normally present in eukaiyotic cells, cannot 
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mediate recombination between two hybrid recombination sites that are formed upon 
recombination between the first recombination site and the second recombination site: ~" 
Examples of these recombinases include, for example, the bacteriophage OCSl integrase 
{see, e.g., Thorpe & Smith (1998) Proc, Natl Acad, ScL USA 95: 5505-5510; Kuhstoss & 
5 Rao (1991) J. MoL BioL 222: 897-890; US Patent No. 5,190,871), a phage P4 recombinase 
(Ow & Ausubel (1 983) J. BacterioL 1 55 : 704-7 1 3), a Listeria phage recombinase, a 
bacteriophage R4 Sre recombmase (Matsuura et al (1996) J. BacterioL 178: 3374-3376), a 
CisA recombinase (Sato et al (1990) 1 BacterioL 172: 1092-1098; Stragier et aL (1989) 
Science 243: 507-512), an XisF recombinase (Carrasco et aL (1994) Genes Dev. 8: 74-83), 
10 and a transposon ln4451 TnpX recombinase (Bannam et aL (1995) MoL MicrobioL 16: 535- 
551; Grelin & Rood (1997) BacterioL 179: 5148-5156). 

Recombinase polypeptides, and nucleic aci(fe that encode the recombinase 
polypeptides, are described in the art and can be obtained using routine methods. For 
example, a vector that includes a nucleic acid firagment that encodes the OC3 1 integrase is 
1 5 described in US Patent No. 5,1 90,871 and is available firom the Northern Regional Research 
Laboratories, Peoria, Illinois 61 604) under the accession number B-1 8477. 

The recombinases can be introduced into the eukaryotic cells that contain the 
recombination sites at which recombination is desired by any suitable method. For example, 
one can introduce the recombinase in polypeptide form, e.g., by microinjection or other 
20 methods. In presently preferred embodiments, however, a gene that encodes the recombinase 
is mtroduced into the cells. Expression of the gene results in production of the recombinase, 
which then catalyzes recombination among the corresponding recombination sites. One can 
introduce the recombinase gene into the cell before, after, or simultaneously with, the 
introduction of the exogenous polynucleotide of interest. In one embodiment, the 
25 recombinase gene is present within the vector that carries the polynucleotide that is to be 
inserted; the recombinase gene can even be included within the polynucleotide. In other 
embodiments, the recombinase gene is introduced into a transgenic eukaryotic organism, 
e.g., a transgenic plant, animal, fungus, or the like, which is then crossed with an organism 
that contains the corresponding recombination sites. 
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Target Organisms 

The methods of the invention are useful for obtaining stable integration 
and/or rearrangement of DNA in any type of eukaryotic cell. For example, the methods are 
xiseful for cells of animals, plants, fimgi, bacteria and other microorganisms. In some 
embodiments, the cells are part of a multicellular organism, e.g., a transgenic plant or 
animal. The methods of the invention are particularly useful in situations where transgenic 
materials are difficult to obtain, such as with transgenic wheat, com, and animals. In these 
situations, finding the rare single copy insertion reqxiires the prior attainment of a large 
number of independently derived transgenic clones, which itself requires great expenditure 
of effort. 

Among the plant targets of particular interest are monocots, including, for 
example, rice, com, wheat, rye, barley, bananas, patais, lilies, orchids, and sedges. Dicots are 
also suitable targets, including, for example, tobacco, apples, potatoes, beets, carrots, 
willows, elms, maples, roses, buttercups, petunias, phloxes, violets and simflowers. Other 
targets include animal and fungal cells. These fists are merely illustrative and not limiting. 

Constructs for Introduction of Exogenous DNA into Target Cells 

The methods of the invention often involve the introduction of exogenous 
DNA into target cells. For example, nucleic acids that include one or more recombination 
sites are often introduced into the cells. The polynucleotide constmcts that are to be 
introduced into the ceUs can include, in addition to the recombination site or sites, a gene or 
other functional sequence that will confer a desired phenotype on the cell. 

In some embodiments, a polynucleotide construct that encodes a recombinase 
is introduced into the eukaryotic cells in addition to the recombination sites. The 
recombinase-encoding polypeptide can be included on the same nucleic acid as the 
recombination site or sites, or can be introduced mto the cell as a separate nucleic acid. The 
present invention provides nucleic acids that include recombination sites, as well as nucleic 
acids in which a recombinase-encoding polynucleotide sequence is operably linked to a 
promoter that functions in the target eukaryotic cell. 

Generally, a polynucleotide that is to be expressed (e.g., a recombinase- 
encoding polynucleotide or transgene of interest) will be present in an expression cassette, 
meaning that the polynucleotide is operably linked to expression control signals, e.g., 
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promoters and terminators, that are functional in the host cell of interest. The genes that 
encode the recombinase and the selectable marker, will also be under the control of such 
signals that are functional in the host cell. Control of expression is most easily achieved by 
selection of a promoter. The transcription terminator is not generally as critical and a variety 

5 of known elements may be used so long as they are recognized by the cell. 

A promoter can be derived &om a gene that is xmder investigation, or can be a 
heterologous promoter that is obtained fix>m a different gene, or &om a different species. 
Where direct expression of a gene in all tissues of a transgenic plant or other organism is 
desired, one can use a "constitutive" promoter, which is generally active under most 

10 environmental conditions and states of development or cell differentiation. Suitable 

constitutive promoters for use in plants include, for example, the cauliflower mosaic virus 
(CaMV) 35S transcription initiation region and region VI promoters, the 1- or 2 - promoter 
derived from T-DNA of Agrobacterium tumefacienSy and other promoters active in plant 
cells that are known to those of skill in the art. Other suitable promoters include the fuU- 

1 5 iCTgth transcript promoter from Figwort mosaic virus, actin promoters, histone promoters, 
tubulin promoters, or the mannopine synthase promoter (MAS). Other constitutive plant 
promoters include varioxis ubiquitin or polyubiquitin promoters derived from, inter alia^ 
Arabidopsis (Sun and Callis, Plant J,, 1 1(5):1017-1027 (1997)), the mas, Mac or DoubleMac 
promoters (described in United States Patent No. 5,106,739 and by Gomai et al. Plant Mol 

20 Biol 15:373-381 (1990)) and other transcription initiation regions from various plant genes 
known to those of skill in the art. Such genes include for example, ACTll from Arabidopsis . 
(Huang et ai. Plant Mol Biol 33:125-139 (1996)), Cat3 from Arabidopsis (GenBank No. 
U43147, Zhong et al, Mol Gen, Genet. 251:196-203 (1996)), the gene encoding stearoyl- 
acyl carrier protein desaturase from Brassica napns (Genbank No. X74782, Solocombe et 

25 al. Plant Physiol 104:1 167-1 176 (1994)), GPcl from maize (GenBank No. X15596, 
Martinez et al, J. Mol Biol 208:551-565 (1989)), and Gpc2 from maize (GenBank No. 
U45855, Manjunath et al. Plant Mol Biol 33:97-1 12 (1997)). Useful promoters for plants 
also include those obtained from Ti- or Ri-plasmids, from plant cells, plant viruses or other 
hosts where the promoters are found to be functional in plants. Bacterial promoters that 

30 function in plants, and thus are suitable for use in the methods of the invention include the 
octopine synthetase promoter, the nopaline synthase promoter, and the manopine synthetase 
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promoter. Suitable endogenous plant promoters include the ribulose-l,6-biphosphate 
(RUBP) carboxylase small subunit (ssu) promoter, the (a-conglycinin promoter, the 
phaseolin promoter, the ADH promoter, and heat-shock promoters. 

Promoters for use in £. coli include the T7, trp, or lambda promoters, a 
ribosome binding site and preferably a transcription termination signal. For eukaryotic cells, 
the control sequences typically include a promoter which optionally includes an enhancer 
derived from immunoglobulin genes, SV40, cytomegalovirus, etc, and a polyadenylation 
sequence, and may include splice donor and acceptor sequences. In yeast, convenient 
promoters include GALl-10 (Johnson and Davies (1984) Mol. Cell Biol 4:1440-1448) 
ADH2 (Russell et al (1983) J. Biol Chem, 258:2674-2682), PH05 (EMBO J. (1982) 6:675- 
680), and MFa (Herskowitz and Oshima (1982) in The Molecular Biology of the Yeast 
Saccharomyces (eds. Strathem, Jones, and Broach) Cold Spring Harbor Lab., Cold Spring 
Harbor, N.Y., pp. 181-209). 

Alternatively, one can use a promoter that directs expression of a gene of 
interest in a specific tissue or is otherwise under more precise environmental or 
developmental control. Such promoters are referred to here as "inducible*' or '*repressible" 
promoters. Examples of environmental conditions that may effect transcription by inducible 
promoters include pathogen attack, anaerobic conditions, ethylene or the presence of light. 
Promoters under developmental control include promoters that initiate transcription only in 
certain tissues, such as leaves, roots, fruit, seeds, or flowers. The operation of a promoter 
may also vary depending on its location in the genome. Thus, an inducible promoter may 
become fully or partially constitutive in certain locations. Inducible promoters are often used 
to control expression of the recombinase gene, thus allowing one to control the timing of the 
recombination reaction. Examples of tissue-specific plant promoters under developmental 
control mclude promoters that initiate transcription only in certain tissues, such as fruit, 
seeds, or flowers. The tissue-specific E8 promoter from tomato is particularly useful for 
directing gene expression so that a desired gene product is located in fiiiits. See, e.g.. Lincohi 
et al (1988) Proc, Natl Acad ScL USA 84: 2793-2797; Deikman et al (1988) EMBOJ, 7: 
3315-3320; Deikmane/ a/. (1992) P/aw/P/iy5io/. 100:2013-2017. Other suitable promoters 
include those from genes encoding embryonic storage proteins. Examples of environmental 
conditions that may affect transcription by inducible promoters include anaerobic conditions. 
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elevated temperature, or the presence of light. Additional organ-specific, tissue-specific 
and/or inducible foreign promoters are also known {see, e,g.^ references cited in Kuhlemeier 
et al {19^1) Ann. Rev. Plant Physiol. 38:221), including those 1,5-ribulose bisphosphate 
carboxylase small subimit genes of Arabidopsis thaliana (the "ssu" promoter), which are 

5 light-inducible and active only in photosynthetic tissue, anther-specific promoters (EP 

344029), and seed-specific promoters of, for example, Arabidopsis thaliana (Krebbers etal. 
(1988) Plant Physiol 87:859). Exemplary green tissue-specific promoters include the maize 
phosphoenol pyruvate carboxylase (PEPC) promoter, small submit ribulose bis-carboxylase 
promoters (ssRUBISCO) and the chlorophyll a/b binding protein promoters. The promoter 

10 may also be a pith-specific promoter, such as the promoter isolated fix»m a plant TrpA gene 
as described in International Publication No. W093/07278. 

Inducible promoters for other organisms include, for example, the arabinose 
promoter, the lacZ promoter, the metallothionein promoter, and the heat shock promoter, as 
well as many others that are known to those of skill in the art. An example of a repressible 

1 5 promoter usefiil in yeasts such as S. pombe is the Pmnt promoter, which is repressible by 
vitamin Bl. 

Typically, constructs to be introduced into these cells are prepared using 
recombinant expression techniques. Recombinant expression techniques involve the 
construction of recombinant nucleic acids and the expression of genes in transfected cells. 

20 Molecular cloning techniques to achieve these ends are known in the art. A wide variety of 
cloning and in vitro amplification methods suitable for the construction of recombinant 
nucleic acids are well-known to persons of skill. Examples of these techniques and 
instructions sufficient to direct persons of skill through many cloning exercises are found in 
Berger and Kammel, Guide to Molecular Cloning Techniques, Methods in Enzymology, 

25 Volume 152, Academic Press, Inc., San Diego, CA (Berger); and Current Protocols in 
Molecular Biology, P.M. Ausubel et al, eds., Current Protocols ^ a joint venture between 
Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (1998 Supplement) 
(Ausubel). 

The construction of polynucleotide constructs generally requires the use of 
30 vectors able to replicate in bacteria. A plethora of kits are commercially available for the 
purification of plasmids from bacteria. For their proper use, follow the manufacturer's 
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instructions {see, for example, EasyPrepJ, FlexiPrepJ, both from Pharmacia Biotech; 
StrataCleanJ, from Stratagene; and, QIAexpress Expression System, Qiagen). The isolated 
and purified plasmids can then be further manipulated to produce other plasmids, used to 
transfect cells or incorporated into Agrobacterium tumefaciens to infect and transform 
plants. Where Agrobacterium is the means of transfonnation, shuttle vectors are constructed. 
Cloning in Streptomyces or Bacillus is also possible. 

Selectable markers are often incorporated into the polynucleotide constructs 
and/or into the vectors that are used to mtroduce the constructs into the target cells. These 
markers permit the selection of colonies of cells containing the polynucleotide of interest. 
Often, the vector will have one selectable marker that is functional in, eg., £. coli, or other 
cells in which the vector is replicated prior to being introduced into the target cell. Examples 
of selectable markers for E, coli include: genes specifying resistance to antibiotics, i.e., 
ampicilUn, tetracycline, kanamycin, erythromycin, or genes conferring other types of 
selectable en2ymattc activities such as p-galactosidase, or the lactose operon. Suitable 
selectable markers for use in mammalian cells include/for example, the dihydrofolate 
reductase gene (DHFR), the thymidine kinase gene (TK), or prokaryotic genes conferring 
drug resistance, gpt (xanthine-guanine phosphoribosyltransferase, which can be selected for 
with mycophenohc acid; neo (neomycin phosphotransferase), which can be selected for with 
G41 8, hygromycin, or puromycin; and DHFR (dihydrofolate reductase), which can be 
selected for with methotrexate (Mulligan & Berg (1981) Proc. Natl Acad. Scu USA 78: 
2072; Southern & Berg (1982) /. MoL Appl Genet. 1 : 327). 

Selection markers for plant cells often confer resistance to a biocide or an 
antibiotic, such as, for example, kanamycin, G 418, bleomycin, hygromycm, or 
chioramphenicol, or herbicide resistance, such as resistance to chlorsulfuron or Basta. 
Examples of suitable coding sequences for selectable markers are: the neo gene which codes 
for the enzyme neomycin phosphotransferase which confers resistance to the antibiotic 
kanamycin (Beck et al (1982) Gene 19:327); the hyg (hpt) gene, which codes for the enzyme 
hygromycin phosphotransferase and confers resistance to the antibiotic hygromycin (Gritz 
and Davies (1983) Gene 25:179); and the bar gene (EP 242236) that codes for 
phosphinothricin acetyl transferase which confers resistance to the herbicidal compounds 
phosphinothricin and bialaphos. 
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If more than one exogenous nucleic acid is to be introduced into a target 
eukaryotic cell, it is generally desirable to use a different selectable marker on each 
exogenous nucleic acid. This allows one to simultaneously select for cells that contain both 
of the desired exogenous nucleic acids. 

5 Methods for Introducing Constructs into Target Cells 

The polynucleotide constmcts that include recombination sites and/or 
recombinase-encoding genes can be introduced into the target cells and/or organisms by any 
of the several means known to those of skill in the art. For instance, the DNA constructs can 
be introduced into plant cells, either in culture or in the organs of a plant by a variety of 

10 conventional techniques. For example, the DNA constructs can be introduced directly to 
plant cells using biolistic methods, such as DNA particle bombardment, or the DNA 
construct can be introduced using techniques such as electroporation and microinjection of 
plant cell protoplasts. Particle-mediated transformation techniques (also known as 
*n)iohstics") are described in Klein et al. Nature, 327:70-73 (1987); Vasil, V. et al, 

15 Bio/Technol 11:1553-1558 (1993); andBecker,D. a/., P/an/ J., 5:299-307(199^^^ These 
methods involve penetration of cells by small particles with the nucleic acid either within the 
matrix of small beads or particles, or on the surface. The biohstic PDS-1000 Gene Gun 
(Biorad, Hercules, CA) uses heUum pressure to accelerate DNA-coated gold or tungsten 
microcarriers toward target cells. The process is ^plicable to a wide range of tissues and 

20 cells from organisms, including plants, bacteria, fungi, algae, intact animal tissues, tissue 
culture cells, and animal embryos. One can employ electronic pulse delivery, which is 
essentially a mild electroporation format for live tissues in animals and patients. Zhao, 
Advanced Drug Delivery Reviews 17:257-262 (1 995). 

Other transformation methods are also known to those of skill in the art. 

25 Microinjection techniques are known in the art and well described in the scientific and patent 
Uterature. The introduction of DNA constmcts using polyethylene glycol (PEG) precipitation 
is described in Paszkowski et al, EMBOJ, 3:2717 (1984). Electroporation techniques are 
described in Fromm et al. Proc, NatL Acad, Sci, USA, 82:5824 (1985). PEG-mediated 
transformation and electroporation of plant protoplasts are also discussed in Lazzeri, P., 

30 Methods Mol Biol 49:95-106 (1995). Methods are known for introduction and expression of 
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heterologous genes in both monocot and dicot plants. See, e.g., US Patent Nos. 5,633,446, 
5,317,096, 5,689,052, 5,159,135, and 5,679,558; Weising et al (1988) Ann, Rev, Genet 
22:421-477. Transformation of monocots in particular can use various techniques including 
electroporation (e.g., Shimamoto et al.. Nature (1992), 338:274-276); biolistics (e.g.. 
European Patent Application 270,356); and Agrobacterium (e.g., Bytebier et al, Proc. Natl 
Acad Set USA (1987) 84:5345-5349). 

For transformatioii of plants, DNA constructs may be combined vnth suitable 
T-DNA flanking regions and mtroduced into a conventional Agrobacterium tumefaciens host 
vector. The virulence functions of the A. tumefaciens host will direct the insertion of a 
transgene and adjacent maricer gene(s) (if present) into the plant cell DNA when the cell is 
infected by the bacteria. Agrdbacterium tumefaciens-m&di\zitA transformation techniques 
are well described in the scientific Uterature. See, for example, Horsch et al Science^ 
233:496-498 (1984), Fraley et al, Proc, Natl Acad, Sci. USA, 80:4803 (1983), and 
Hooykaas, Plant Mol Biol, 13:327-336 (1989), Bechtold et al, Comptes Rendus De L 
Academic Des Sciences Serie lii-Sciences De La Vie-Life Sciences, 316:1194-1199 (1993), 
Valvekens et al, Proc, Natl Acad, ScL USA, 85:5536-5540 (1988). For a review of gene 
transfer methods for plant and cell cultures, 5ee, Fisk et al, Scientia Horticulturae 55:5-36 
(1993) and Potrykus, CJBA Found, Symp. 154:198 (1990). 

Other methods for dehvery of polynucleotide sequences into cells include, for 
example liposome-based gene dehvery (Debs and Zhu (1993) WO 93/24640; Mannino and 
Gould-Fogerite (1988) BioTechniques 6(7): 682-691 ; Rose U.S. Pat No. 5,279,833; Brigham 
(1991) WO 91/06309; and Feigner et al (1987) Proc, Natl Acad, ScL USA 84: 7413-7414), 
as well as use of viral vectors (eg., adenoviral {see, e.g,, Bems et al (1995) Ann, NY Acad, 
Sci, 772: 95-104; Ah et al (1994) Gene Ther, 1: 367-384; and Haddada et al (1995) Curr, 
Top, Microbiol Immunol 199 ( Pt 3): 297-306 for review), papillomaviral, retroviral (^ee, 
e.g., Buchscher et al (1992) J, Virol 66(5) 2731-2739; Johann et al (1992) J, Virol 66 
(5):1635-1640 (1992); Sommerfelt etal, (1990) Virol 176:58-59; Wilson et al (1989) J. 
Virol 63:2374-2378; Miller et al., 1 Virol 65:2220-2224 (1991); Wong-Staal et al, 
PCT/US94/05700, and Rosenburg and Fauci (1993) in Fundamental Immunology, Third 
Edition Paul (ed) Raven Press, Ltd., New York and the references therein, and Yu et al. 
Gene Therapy (1994) supra,), and adeno-associated viral vectors {see. West et al (1987) 
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Virology 160:38-47; Carter aL (1989) U.S. Patent No. 4,797,368; Carter et al WO 
93/24641 (1993); Kotin (1994) Human Gene Therapy 5:793-801; Muzyczka (1994) 1 Clin, 
InvsL 94:1351 and Samulski (supra) for an overview of AAV vectors; see also, Lebkowski, 
U.S. Pat. No. 5,173,414; Tratschin et al (1985) MoL Cell Biol 5(1 1):3251-3260; Tratschin 
et al (1984) Mol Cell Biol, 4:2072-2081; Hermbnat and Muzyczka (1984) Proc, Natl 
Acad, Scl USA, 8 1 :6466-6470; McLaughlin et al (1988) and Samulski et al (1 989) J. 
Virol, 63:03822-3828), and the like. 

Methods by which one can analyze the integration pattern of the introduced 
exogenous DNA are well known to those of skill in the art. For example, one can extract 
DNA from the transformed cells, digest the DNA with one or more restriction enzymes, and 
hybridize to a labeled fragment of the polynucleotide consmict. The inserted sequence can 
also be identified using the polymerase chain reaction (PCR). See, e.g., Sambrook et al. 
Molecular Cloning - A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1989 for descriptions of these and other suitable methods. 

Regeneration of Transgenic Plants and Animals 

The methods of the invention are particidarly iisefiil for obtaiiiing traiisgenic 

and chimeric multicellular organisms that have a stably integrated exogenous polynucleotide 
or other stable rearrangement of cellular nucleic acids. Methods for obtaining transgenic and 
chimeric organisms, both plants and animals, are well known to those of skill in the art. 

Transformed plant cells, derived by any of the above transformation 
techniques, can be cultured to regenerate a whole plant which possesses the transformed 
genotype and thus the desired phenotype. Such regeneration techniques rely on 
manipulation of certain phytohormones in a tissue culture growth medium, typically relying 
on a biocide and/or herbicide marker which has been introduced together with the desired 
nucleotide sequences. Plant regeneration from cultured protoplasts is described in Evans et 
al. Protoplasts Isolation and Culture, Handbook of Plant Cell Culture, pp. 124-176, 
Macmillian Publishing Company, New York (1983); and in Binding, Regeneration of 
Plants, Plant Protoplasts, pp. 21-73, CRC Press, Boca Raton, (1985). Regeneration can also 
be obtained from plant callus, explants, somatic embryos (Dandekar et al, J. Tissue Cult, 
Metk, 12:145 (1989); McGranahan et al. Plant Cell Rep, 8:512 (1990)), organs, or parts 
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thereof. Such regeneration techniques are described generally in Klee et alj Ann. Rev. of 
Plant Phys., 38:467^86 (1987). 

The methods are useful for producing transgenic and chimeric animals of 
most vertebrate species. Such species include, but are not limited to, nonhuman mammals, 

5 including rodents such as mice and rats, rabbits, ovines such as sheep and goats, porcines 
such as pigs, and bovines such as cattle and buffalo. Methods of obtaining transgenic 
animals are described in, for example, Puhler, A., Ed., Genetic Engineering of Animals^ 
VCH Publ., 1993; Murphy and Carter, Eds., Transgenesis Techniques : Principles and 
Protocols (Methods in Molecular Biology, Vol. 18), 1993; and Pinkert, CA, Ed., Transgenic 

10 Animal Technology : A Laboratory Handbook, Academic Press, 1994. Transgenic fish 
having specific genetic modifications can also be made using the claimed methods. See, 
e,g, Iyengar et al (1996) Transgenic Res, 5: 147-166 for general methods of making 
transgenic fish. 

One method of obtainihg a transgenic or chimeric animal having specific 

1 5 modifications in its genome is to contact fertilized oocytes with a vector that includes the 
polynucleotide of interest flanked by recombination sites. For some animals, such as mice 
fertilization is performed in vivo and fertilized ova are surgically removed In other animals, 
particularly bovines, it is preferably to remove ova firom live or slaughterhouse animals and 
fertilize the ova in vitro. See DeBoer et al, WO 91/08216. In vf/ro fertilization permits the 

20 modifications to be introduced into substantially synchronous cells. Fertilized oocytes are 
then cultured in vitro imtil a pre-implantation embryo is obtained containing about 16-150 
cells. The 16-32 cell stage of an embryo is described as a morula. Pre-implantation 
embryos containing more than 32 cells are termed blastocysts. These embryos ^how the 
development of a blastocoel cavity, typically at the 64 cell stage. If desired, the presence of 

25 a desired exogenous polynucleotide in the embryo cells can be detected by methods known 
to tiiose of skill in the art. Methods for culturing fertilized oocytes to the pre-implantation 
stage are described by Gordon et al (1984) Methods Enzymol 101: 414; Hogan et al 
Manipulation of the Mouse Embryo: A Laboratory Manual, C.S.H.L. N.Y. (1986) (mouse 
embryo); Hammer et al (1985) Nature 315: 680 (rabbit and porcine embryos); Gandolfi et 

30 al (1987)7. Reprod. Pert, 81: 23-28; Rexroad a/. (1988) J. Anim, Sci, 66: 947-953 (ovine 
embryos) and Eyestone et al (1989) 1 Reprod. Pert, 85: 715-720; Camous et al (1984) 1 
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Reprod. Fert. 72: 779-785; and Heyman et al (1987) Theriogenology 27: 5968 (bovine 
embryos). Sometimes pre-implantation embryos are stored frozen for a period pending 
implantation. Pre-implantation embryos are transferred to an appropriate female resulting in 
the birth of a transgenic or chimeric animal depending upon the stage of development when 
5 the transgene is integrated. Chimeric mammals can be bred to form true germline transgenic 
animals. 

Alternatively, the methods can be used to obtain embryonic stem cells (ES) 
that have a single copy of the desired exogenous polynucleotide. These cells are obtained 
from preimplantation embryos cultured in vitro. See, e.g.. Hooper, ML^ Embryonal Stem 

10 Cells : Introducing Planned Changes into the Animal Germline (Modem Genetics, v. 1), 
Int'l. Pub. Distrib., Inc., 1993; Bradley et al. (1984) Nature 309, 255-258. Transforaied ES 
cells are combined with blastocysts from a non-human animal. The ES cells colonize the 
embryo and in some embryos form the germ line of the resulting chimeric animal. See 
Jaenisch, Science, 240: 1468-1474 (1988). Alternatively, ES cells or somatic cells that can 

1 5 reconstitute an organism ("somatic repopulating cells**) can be used as a source of nuclei for 
transplantation into an enucleated fertilized oocyte giving rise to a transgenic mammal. See, 
e.g., Wihnut et al (1997) Nature 385: 810-813. 

EXAMPLES 

The following examples are oflFered to illustrate, but not to limit the present 

20 invention. 

Example 1 

The <DC31 Recombination Svstem Functions in Schizosaccharomvces vombe 

This Example demonstrates that the Streptomyces bacteriophage <1>C3 1 site- 
specific recombination system functions in eukaryotic cells. A bacteriophage attachment site 
25 {attP) was introduced into a chromosome of Schizosaccharomyces pombe at the S, pombe 
leu] locus. This target strain was subsequently transformed with a plasmid that contains the 
bacterial attachment site (attB) linked to a ura4* selectable marker. When co-transformed 
with a second plasmid harboring the <1>C31 integrase gene, high efiBciency transforaiation to 
Ura"^ was observed under conditions where the integtase gene was expressed. 
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Southern analysis of the integration events shows insertion of the attB'Ura4^ 
plasmid into the attP site of the leul locus. Nucleotide sequence of the hybrid junctions 
revealed that the attB x attP recombination reaction is precise. 

Materials and Methods 

5 Recombinant DNA 

Standard methods were used throughout. £. co/r strain XL2-Blue {recAl 
endAl gyrA96 thi-1 hsdRl 7 supE44 relAl lac [F* proAB lacB ZtsMISTnlO (TetO Amy 
Cam^, Strategene) served as host for DNA constructs. 

Media 

10 Fission yeast strains were grown on minimal medium (EMM-low glucose, 

from BiolOl) supplemented as needed with 225 mg/1 adenine, histidine, leucine or uracil. 
Minimal plates with 5-FOA (5-floroorotic acid, from Zymo Research, Inc.) were prepared 
according to Grimm et al ((1988) Mol Gen. Genet 215: 81-86) and were supplemented 
with adenine, histidine, and leucine. When used, thiamine was added to 5 jig/ml. 

15 S. pombe with <K31 attP target 

The 84 bp OC3 1 a^rP site (abbreviated as PP'), isolated as an Apal-Sacl 
fragment from pHS282 (Thorpe & Smith (1998) Proc. Natl Acad. ScL USA 95:5505-5510) 
was cloned into the same sites of the S. pombe integrating vector pJK148 (Keeney & Boeke 
(1994) Genetics 136:849-856) to make pLT44: This plasmid was targeted to the S. pombe 

20 leul-32 allele by lithium acetate mediated transformation with Ndel cut DNA. The recipient 
host FY527 {h' ade6-M216 his3-Dl leul-32 ura4'D18), converted to Leu* by homologous 
recombination with pLT44, was examined by Southern analysis. One Leu* transformant, 
designated FY527attP, was found to contain a single copy of pLT44. Another transformant, 
designated FY527attPx2, harbors a tandem plasmid insertion. 

25 Integrative ura4* vector with <PC31 attB site 

The S. pombe ura4^ gene, excised from pTZura4 (S. Forsburg) on a 1.8 kb 
EcoKL-BamHI fragment, was inserted into pJK148 cut with the same enzymes to create 
pLT40. The <I>C31 attB site (abbreviated as BB'), isolated from pHS21 as a 500 bp BamBI' 
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Xbal fragment, was ligated into pLT40 cut with those enzymes, creating pLT42. Most of the 
leul gene was removed from pLT42 by deleting aXhoI fragment to create pLT45. This 
removed all but 229 bp of leul from pLT45 and reduced its transformation efficiency to that 
of a plasmid without any leul homology. pLT50, which has a second attB site in the same 
5 orientation immediately on the other side of ura4^ was constructed by first subcloning the 
attB BamHl'Sacl fragment from pLT42 into pUC19, excising it with EcoKl and SalR, and 
subsequently inserting it into pLT45 cut with EcoRl and Xhol. The second attB site in the 
final construct was sequenced once on each strand and found to be identical to the first attB 
site. 

10 Linear DNA transformation 

The attB'ura4^'attB linear DNA was prepared as an AttJI-AlwNl firagment 
purified from pLT50, or as a PGR product using pLT50 as template. PGR was conducted 
using standard conditions with a T3 primer and a second primer (5* ggc cct gaa att gtt get tct 
gcc 3') corresponding to the plasmid backbone of pJK148. 

15 Repressible synthesis of <X31 integrase 

The S, pombe Pmnt promoter, repressible by vitamin B 1, was excised as a 1 .2 
kb Pstl'Sacl firagment from pM0147 and inserted into the his3^y arsl vector pBG2 (Ohi et 
al (1996) Gene 174: 315-318) cut with the same enzymes, creating pLT41, A 2.0 kb Sad 
firagment containing the OG3 1 int coding region was transferred from pHS33 (Thorpe & 

20 Smith (1998) supra) to the Sad site of pLT41 . A clone in which the int coding region is 
oriented such that expression is under the control of Pmnt was designated pLT43 . 

Molecular analyses 

Southern analysis was perfonned using the Genius™ system from Boehringer 
Mannheim. A 998 bp mtemal EcoRV fragment of leul ^ a 1.8 kb fragment of ura4 , and the 
25 2.0 kb OG3 1 int gene were digoxigen-Iabeled by the random primer method and used as 
probes. Polymerase chain reaction was performed on a Perkin Ehner Getus Gene Amp PGR 
9600 using Stratagene Turbo PFU enzyme or VENT polymerase. The standard T3 and T7 
primers were used where possible. The «ra^ primer (5' gtc aaa aag ttt cgt caa tat cac 3* 
(SEQ ID NO: 1)) and the pJK148 primers were purchased from Operon Technologies. For 
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all PCR reactions an annealing temperature of 51°C and a 30-second extension time were 
used. 

Results and Discussion 

Inserting a target site into the S. pombe genome 

5 To create a host strain with a target site for OCSl -mediated integration, the 

OC31 attP site was inserted by homologous recombination into the leu\ locus of the fission 
yeast genome to form the Leu"*" strain FY527attP (Figure 1 A). Previous studies showed that 
when S. pombe DNA is cleaved with Xbal and probed with an internal 1 kb Augment of the 
leu\^ gene, the probe detects a 14 kb band (Keeney & Boeke (1994) Genetics 136: 849-856). 

10 Insertion of the^eu"^ plasmid pJK148 at the leul-32 locus results in detection of 3 and 18 kb 
bands (Figure lA). Since pLT44 differs fi-om pJK148 by the inclusion of an 84 bp Q>C3l 
att? element, integration of pLT44 at /ewl-32 yielded the same 3 kb and 18 kb hybridization 
pattern in FY527a//P. The absence of other hybridizing fragments mdicates that the pLT44 
DNA resides as a single integrated copy. 

1 5 <PC31-integase-mediated transformation 

FY527atttP was transformed with pLT45, which harbors ura/' and an attB 
sequence (BB*) but lacks an origin of replication. This construct was introduced by itself or 
with pLT43, a his 3^ repUcating vector that produces OC3 1 integrase. The inclusion of 
pLT43 increased the number of Ura^ transformants an average of 15 fold (Table 1). This 

20 enhancement caimot be attributed to the recombination between pLT45 and the repUcation- 
proficient pLT43, as its effect is dependent on integrase gene expression. Transcription of 
the integrase gene is under the control of Pmnt^ a promoter repressible by high levels of 
vitamin Bl (Maundrell, K. (1993) Gene 123: 127-130). The repression is not absolute 
(Forsburg, S. L. (1993) Nucleic Acids Res 21: 2955-2956) but reduces the production of 

25 integrase protein. When thiamine was added to the growth medium, the number of Ura* 
transformants decreased to near background level. The frequency of Ura^ tranformants did 
not change significantly whether or not the integrase plasmid was co-selected by omission of 
histidine &om the medium. The transformation competency of FY527atttP was estimated 
fit)m the number of His^ transformants obtained with pLT43 or its progenitor plasmid pBG2. 
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Compared to the frequency of either replicating plasmid, the pLT43-dependent 
transformation of FY527attP averaged about 15%. 

Table 1 : Integrase-dependent site-specij5c insertion in S. pombe FY527attP. 

DNA Selection Bl Transformants Relative Class a Class b Others 
(l^g) (S^ig) per lO' cells Value' 
(±sd)* 

pLT43 ffis* - 7200 (±2200) 100 

pLT45 ura* " 63 (±10) 1 0%^ 100%* 

pLT45+ - 1100 (±120) 15 ggo// go/^t go/^t 
PLT43 

pLT45+ ura* + 120(^16) 2 o%* 25%* 75%* 
pLT43 


•From three independent experiments 

'(transformation efficiency of the DNA of interest)/(transformation efBciency of pLT43) x 
100 
10 ^n=16 
*n=8 

^ 

{PCSl'integrase promoted attP jc attB recombination 

Recombination between the pLT45-encoded <DC3 1 attB element and the 
chromosomally situated atff sequence would incorporate the circular DNA into the leul 

15 locus as depicted in Figure IB. If this reaction occurs, A^^al-firactionated genomic DNA 

from the Ura^ transformants is probed with leul DNA, tiie 3 kb band will remain unchanged, 
while the 18 kb band will increase to -23 kb (Figure IC), Randomly selected Ura^ colonies 
were examined by hybridization analysis. Of eight isolates derived from experiments where 
<tC3l integrase gene expression was derepressed by the omission of thiamine, seven showed 

20 the presence of this -23 kb band. This same size band hybridized to the ura4 probe. This 
contrasts with the lack of ura4 hybridization with the parental strain, as expected from its 
ttrfl4-D18 deletion allele. One of these seven isolates showed additional bands hybridizing 
to both probes. This candidate appears to have a DNA rearrangement at the leul locus in 
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addition to a site-specific recombination event. The leul rearrangement was probably 
catalyzed by the operative S. pombe homologous recombination system. The remaining 
isolate had not experienced a site-specific recombination event and appeared to have gained 
uracil prototrophy by recombination between pLT45 and pLT43. Of these eight isolates, 

5 half were selected as both Ura^ and His*^, but no significant difference was found between 
this group and the group selected for Ura^ only. 

From transformation experiments plated in the presence of vitamin B 1 , an 
equal number of Ura^ transformants was examined by DNA hybridization. The thiamine- 
repressible Pnmt promoter is expected to limit integrase production, and thereby site-specific 

10 integration. Two of the eight Ura'*' candidates isolated firom this low fi*equency 

transformation showed a band of 23 kb hybridizing to leul and to the ura4 probe. However, 
since both probes detected an additional band, they do not represent correct integration 
events, and we grouped them as class b integrants. In the other six isolates, the hybridization 
patterns are difBcuU to interpret. In some of them, the 3 kb band was not detected by the 

1 5 leul probe, as though the locus has experienced some rearrangenaent. In many of them, the 
weak hybridization to ura4 suggests that the Ura^ phenotype may not be due to the stable 
maiiitenance of pLT45 in the genome. 

To ascertain the proportion of transformants maintaining the integrase 
plasmid in the absence of selection, the blots were re-probed 'with the integrase gene 

20 sequence. Those selected as Ura'^His'^ would be expected to maintain the plasmid, and did 
so, as the hybridization revealed. Five of the eight isolates selected as Ura^ without regard to 
the His phenotype also gave biands hybridizmg to the integrase probe. To confirm that loss 
of would not affect stable integration, another set of randomly chosen Ura^ cells were 
grown non-selectively for a number of generations and screened for His" progeny that have 

25 lost pLT43. The analysis of eight representative Ura* His' clones showed that all had a 
single copy of pLT45 precisely integrated at the chromosome-situated attP site. The DNA 
of these integrants did not hybridize with the integrase probe. In contrast, the background 
fi-equency Ura**^ clones derived by transformation of pLT45 alone gave the parental 
configuration of hybridizing bands at the leul locus and additional faint bands at 5 kb and 7 

30 kb. These observations are consistent with either integration of pLT45 elsewhere in the 
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genome, or maintenance of the plasmid in some cells despite the lack of a 5. pombe 
replication origin. 

) 

Conservative site-specific recombination 

PCR was used to retrieve the attPlattB recombinant junctions from three 

5 representative Ura^ candidates. One of the hybrid sites, attR (PB*) would be flariked by T3 
and T7 promoters; the other site, attL (BP*) by the T3 promoter and ura4 DNA (Figure IC). 
In each case, primer pairs directed to these sequences amplified a band of the expected size 
while the original auP (PP') was no longer found. This contrasts with the parental strain 
FY527attP, where attP, but neither attL nor attR, was detected. The nucleotide sequence of 

10 three representative attL and attR PCR products showed the absence of accompanying 
mutations. Hence, as in bacteria and mammalian cells, <1>C31 mediated site-specific 
recombination in S. pombe is a conservative recombination reaction. 

^31 integrase does not excise integrated molecules 

Thorpe and Smith ((1998) Proc. Natl Acad. Sci. USA 95: 5505-5510) did not 

1 5 detect reversal of the <I>C3 1 integrase reaction by analysis of gel-fractionated DNA 

fiagments. We examined the possibility of a reverse reaction through a genetic selection 
strategy. The precise integration of pLT45 into EY527attP was confirmed for three clones 
by Southern analysis; these strains were then re-transformed with pLT43. Excision of 
pLT45 would result in loss of the ura4^ maricer; the Ura" phenotype can be scored on plates 

20 with 5-FOA (Grimm et al (1988) Mol Gen, Genet, 215: 81-86). The frequencies of Ura' 
segregants from cultures of the three Ura^ His' progenitors were 5.7 x 10"^, 7.1 x 10"^ and 5.6 
X 10"*. In contrast, the firequencies of Ura" colonies from the three Ura^ His* derivatives were 
somewhat higher: 1 .1 x 10■^ 3.8 x 10'^ and 2.3 x 10'^ respective 19-, 5- and 4-fold increases. 
When a control vector lacking the integrase gene, pBG2, was used instead, increased rates 

25 of 5-FOA resistance were also found: 1.0 x 10'^, 1.0 x 10'^ and 8.0 x 10*\ respectively. The 
transformation process itself appears mutagenic. 

Three Ura* His* clones from each of the three cultures that had been 
transformed by pLT45 were analyzeji by Southern blotting. One isolate had a DNA pattern 
consistent with stable integration of pLT45 into FY527attP. Therefore, in this clone, the 

30 Ura' phenotype was caused by a mutation that did not appreciably alter the restriction 
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pattern, rather than by reversal of the site-specific recombination reaction. The second clone 
showed a Southern pattern characteristic of FY527attP lacking a pLT45 insertion, the third 
had a pattern consistent with a mixttire of two types of cells, those like FY527attP without a 
pLT45 insertion, and those hke the FY527attP progenitor strain FY527. The latter structure 
could arise firom intrachromosomal homologous recombination between the leu repeats, 
reversing the insertion of pLT44 (Figure 1^). If precise excision of the integrated plasmid 
DNA occurred in the latter two candidates, the attP site would be regenerated; this would be 
detectable with PGR. The size of the PGR product was that expected for an intact hybrid 
site, the presence of the hybrid site was confirmed by sequencing the PGR product. These 
observations are consistent with the idea that deletion of the ura4 gene occurred by some 
mechanism other than OG31 -mediated excision. 

Summary 

The integration of a circular molecule at a single target site was an efficient 
process yielding precise insertions in nearly all transformants. The few aberrant events we 
1 5 observed are probably largely attributable to the S. pombe recombination system acting on 
the leul repetitive DNA. When integrase production was limited through the repression of 
its promoter, the number of transformants was reduced to near background level. Under 
these conditions, few of the recovered transformants were derived firom <t)G31 site-specific 
recombination. Functional operation of the OGSl site-specific recombination system in 
20 eukaryotic cells presents new opportunities for the manipulation of transgenes and 

chromosomes. The OG31 system can be used with selective placement of attB and attP sites 
to delete, invert or insert DNA. An important feature of this system is that the attB x attP 
reaction is irreversible in the absence of an excision-specific protein. 

Example 2 

25 The OCai Integrase Functions in CHO Cells to Create Stable Integration 

This Example describes an experiment in which the (DG3 1 integrase was 
tested for ability to mediate recombination between attB and attP recombination sites in 
Chinese hamster ovary (CHO) cells. 


5 
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Methods 

The CHO cell line 51YT21 1 was transfected with the aZ/P-containing plasmid 
pFYl, which included a selectable marker that confers zeocin resistance (Figure 2). After 
being single colony pxirified twice, six zeocin resistant cell lines were isolated. Analysis by 

5 Southern DNA hybridization confirmed that each of the six cell lines had at least one 
molecule of pFYl integrated into the genome. 

Each of the six cell lines was transfected with the a//5-containing plasmid 
pFY9 and the znr-containing plasmid pFY6 to test for site-specific recombination between 
the attB sites on pFY9 and the attP site on the chromosomal copy of pFY'l. As control, the 

10 same cell lines were transfected with pFY9, but without the mr-containing pFY6. The pFY9 
plasmid included a neomycin resistance selectable marker under the control of an SV40 
early promoter, as well as a green fluorescent protein (GFP) coding sequence that is not 
linked to a promoter. Site-specific recombination would thus be expected to place the GFP 
coding sequence under the control of a human cytomegalovirus promoter that was included 

15 in pFYl, resulting in expression of GFP. 

Results 

Transfection results: Neomycin resistant colonies were placed under the 
microscope to observe whether the GFP gene is active. A large percentage of the cells 
transfected with pFY9+pFY6, but only a few of the cells transfected with the pFY9 alone 
20 showed GFP activity. This is consistent with site-specific integration of pFY9 when co- 
transfected with pFY6, and random insertion of pFY9 in the absence of a co-transfected int 
gene. 

PGR analysis was conducted using a primer set that corresponds to Pc 
(human cytomegalovirus promoter) and GFP (Figure 2). These primers would be expected 

25 to amplify a band of --0.6 kb corresponding to the integration junction. As neomycin 

resistant colonies could arise from both site-specific integration and random integration, and 
that the GFP marker does not confer a selectable trait, it was difficult to obtain pure cultures 
of integrant clones. Therefore, pools of neomycin resistant cells from each transfected line 
were subjected to PGR analysis to examine if the integration junction were present among 

30 the neomycin resistant cells. A band of the expected size of -^.6 kb was obtained from two 
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lines. This indicates that the attB x attP recombination junction has fonned linking Pc with 
GFP. 

Example 3 

a)C31 Integrase Catalyzes Site-Specific Recombination in CHO Cells 
5 This Example describes a second experiment in which the <I>C3 1 integrase 

was tested for ability to integrate a DNA molecule into the chromosome of Chinese hamster 
ovary (CHO) cells through the recombination between attP and attB sites. 

Methods 

Plasmid constructs 

10 Chromosomal attB target constructs pFY12, pFY14 and pFYlS 

The plasmid pcDNA3.1/His/lacZ (Invitrogen) was used as a vector backbone. 
A synthetic oligonucleotide contained different length of the attB site, flanked by HindSH 
and Kpnl sites, was inserted between the if indlH (AAGCTT) and Kpnl (GGTACC) sites of 
pcDNA3.1/ffis/lacZ. 

15 The plasmid pFY12 contains 90 bp of the attB sequence (AAGCTT 

gacggtctcg aagccgcggt gcgggtgcca gggcgtgccc ttgggctccc cgggcgcgta ctccacctca cccatctggt 

ccatcatgat GGTACC) (SEQ ID NO: 2). 

The plasmid pFY14 contained 50 bp of the attB site (AAGCTT gcgggtgcca 

gggcgtgccc ttgggctccc cgggcgcgta ctccacctca TGGTACC) (SEQ ID NO: 3). 
20 The plasmid pFY15 contained 30 bp of attB (AAGCTT ccagggcgtg 

cccttgggct ccccgggcgc ATGGTACC) (SEQ ID NO: 4). 

Integratmg attP plasmids pFyi7, pFY19. pFY20 

The hpt gene encoding for resistance to hygromycin, obtained as a 1 .6 kb 

Bamm to Kpnl fragment from pED113, was inserted between the BamBl and Kpnl sites of 
25 pBluescript n SK to generate the control plasmid pBSK-hpt. 

A synthetic oligonucleotide containing different lengths of the attP site was 

inserted between Sad (GTCGAC) and Bamm (GGATCC) sites in pBSK-hpt to generate the 

following plasmids: 
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a) TtiP plagmid pFYM contains 90 bp (GAGCTC-g aagcggttt tcgggagtag- 
tgccccaact ggggtaacct ttgagttctc tcagttgggg gcgtagggtc gccgacatga cacaaggggt-GGATCC) of 

flf/Psite(SEQIDNO: 5). 

b) T^^ placmiH pFYI Q contains 5Q bp of attP site fGAGCTC-t gccccaact 

5 ggggtaacct ttgagttctc tcagttgggg gcgtagggtc-GGATCQ (SEQ ID NO: 6). 

c) The plasmidpF^'20 contains 32 bp of a//P site (GAGCTC-actggggtaa 

cctttgagtt ctctcagtt g ggATCC) (SEQ ID NO: 7) is called pFY20. 
Tnteerase expressing construct pFY6 

An EcoRl to BaniHl fragment containing the nearly complete open reading 
10 frame of the integrase gene was inserted between the jEcaRI and BaniHl sites of 

pcDNA3.1/Zeo(-) (Invitrogen), A synthetic oligonucleotide (GGGCCCGCCACGATGACA 
CAAGGGGTTGTGACCGGGGTGGACACGTACGCGGGTGCTTACGACCGTCAGTC^ 
CGCGAGCGCGAGAATTC) (SEQ ID NO: 8) containing a Kozack sequence and the N- 
terminal amino acid coding sequences of the integrase gene was subsequently inserted 
1 5 between the Apal and EcoRI sites to reconstruct the open reading frame. This orientation 
places a complete integrase coding region under the control of the CMV (human 
cytomegalovirus) promoter in pcDNA3.1/Zeo(-). 

Transfection protocol 

The CHO cell line K-1 was transfected with attB target constructs pFY12, 
20 pFY14 or pFY15 (Figure 3). These plasmids harbor the selectable marker for neomycin 
resistance, and an attB site of various lengths located between Pc (human cytomegalovirus 
promoter) and the lacZ coding region. Plasmids pFY12, pFY14 and pFY15 contain, 
respectively, 90, 50 and 30 bp of the attB sequence. Neomycin-resistant cell lines were 
obtained from consecutive purification of single colonies. Four lines of each construct were 
25 used for integration experiments. 

Each of the 12 lines was transfected with pFY6, a 0031 integrase expression 
plasmid, along with an integration vector, pFY17, pFY19, or pFY20. The plasmids pFY17, 
pFY19 and pFY20 harbor an attP sequence of lengths 90, 50 and 32 bp, respectively. The 
attP sequence is situated upstream of the hpt open reading frame, which encodes 
30 hygromycin phosphotransferase, an enzyme that confers resistance to hygromycin. There is 
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no promoter upstream of the attP-hpt segment and hpt is therefore not expressed unless the 
plasmid integrates into the genome in such a way that the hpt coding region fuses with a 
genomic promoter. For control, pBSK-hpt was used to monitor the frequency of promoter 
fusion to hpL The plasmid pBSK-hpt is identical to pFY17, pFY19, and pFY20 except it 
lacks an attP sequence. The recombination between attP and attB sites is expected to insert 
the integration vector into the chromosome target to generate a Pc-attL-hpt linkage. 
Expression of hpt will confer resistance to hygromycin. 

Results 

Transfection results: Hygromycin resistant colonies were scored for each 
integration plasmid which was transfected into the 12 cell lines (Table 2). From 1x10^ cells 
plated, pBSK-hpt transfections failed to produce a significant number of resistant colonies. 
This indicates that the frequency of the hpt coding region fiising to a genomic promoter is 
extremely low. In contrast, pFY17, pFY19 and pFY20 yielded up to a thousand fold higher 
number of hygromycin resistant colonies, depending on the particular integration plasmid 
and the particular cell line. Higher numbers of hygromycin resistant colonies were produced 
from the transfection of pFY19 or pFYl 7 into FY12 lines. This indicates that the 
recombmation between longer attB and attP sequences is more efiBcient than the 
recombination between shorter attB and attP sites. 

PGR was used to detect the expected -0.8 Kb junction band from 
representative colonies. Primers; corresponding to the human cytomegalovirus promoter and 
the hpt coding region amplified a PGR product of the expected size (0.8 kb). This indicates 
that Pc is linked to the hpt coding region, consistent with recombination between the 
genomic attB site and the plasmid attP sequence. 

Example 4 

The <DC31 Integrase Functions in Plant Chromosomes to Recombine a ttP and attB Sites 

This example describes an experiment m which the Q>Ci\ mtegrase was 
tested for abihty to recombine attP and attB sites that are present in a plant chromosome. 
The constructs and strategy for this experiment are shown in Figure 4. 
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Table 2 

Number of hygromycin resistant colonies per 1x10^ transfected cells. 



Integration plasmids 

Target cell lines 

pFY17 

pFY19 

pFY20 

pBbK-iq)t 


(90 bp AttP) 

(50 bp AttP) 

(32 bp AttP) 

/—KM hMTI 

(no attr site) 

(attB-90) 





FY12-1 

256 

856 

275 

0 

E^12-2 

976 

896 

185 

0 

FY12-3 

246 

976 

114 

2 

FY12-4 

964 

896 

327 

0 

(attB-50) 





FY14-3 

23 

240 

45 

0 

FY14-7 

96 

245 

67 

0 

FY14-8 

21 

135 

49 

2 

FY14-9 

89 

255 

78 

1 

(attB-30) 




0 

FY15-1 

0 

24 

0 

FY15-2 

0 

345 

34 

0 

FY15-3 

55 

455 

23 

1 

FY15-4 

0 

0 

0 

0 


Methods 

The construct pWP29 contains &e fragment consisting of SSS-attP-npt-attB- 
gus, flanked by RB and LB, where 35S is the cauliflower mosaic virus promoter, npt is the 
coding region for neomycin phosphotransferase, and gtds is the coding region for 
glucuronidase. RB and LB are the right and left Agrobacterium T-DNA border sequences, 
respectively. The attP site between 35S and npt serves as a i^on-translated leader sequence. 
Transcription of npr by 35 S confers resistance to kanamycin. The gus coding region is not 
transcribed due to the lack of an upstream promoter. 

A second construct used for plant transformation is pWP24. This construct 
contains the fragment Pnos-npt'SSS-int, flanked by RB and LB, where Pnos is the nopaline 
synthase promoter, and int is the OC31 integrase coding region. Both npt and int are 
transcribed from their respective upstream promoters. 
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If the two constructs were present in the same genome, the expression of in/ 
from the pWP24 bearing chromosome would be expected to produce fimctional OC31 
integrase to catalyze the recombination between attB and attP sites situated on the pWP29- 
beaiing chromosome. The recombination event would be expected to delete the npt gene 
5 from the pWP29 construct and fuse 35S to gus. The resulting configuration would be 358- 
attR'gus^ where attR is a hybrid site formed by the recombination between attP and attB^ 
also designated as PB* (Figure 4). The deletion oinpt brings gus xmder the transcription of 
35S and would be expected to yield plants with GUS enzyme activity. This activity can be 
detected through histochemical staining of the plant tissue. 

10 Results 

A transient expression assay was conducted to determine whether pWP29 
was functional for recombination. Through the biolistics-mediated delivery of naked DNA, 
pWP29 was cointroduced with pWP8 into maize BMS cells. The construct pWP8 has the 
integrase gene fused behind the niaize ubiquitin promoter for expression in monocot cells. 

1 5 Blue spots were observed when both plasmids were co-introduced, but were not found if 
only one of the plasmids was used. This indicated that site-specific recombination took 
place in maize cells and that the attP and attB sites in pWP29 were functional sites. 

Kanamycin resistant tobacco plants were regenerated by Agrobacterium- 
mediated transformation using pWP29 or pWP24. Another transient expression assay was 

20 conducted to determine whether the pWP24 lines produced functional integrase. The 

construct pWP29 was mtroduced into the pWP24 plants through biolistics mediated delivery 
of naked DNA. Cells that take up the pWP29 DNA would be expected to express GUS 
enzyme activity as a result of the formation of a 35S-attR'gus configuration. Indeed, two 
lines, 24.3 and 24.4 yielded blue spots consistent of functional integrase-mediated site- 

25 specific recombination between the attP and attB sites. 

These two pWP24 integrase lines were crossed to pWP29 tester lines to 
produce progeny with the chromosomes carrying pWP29 and pWP24 in the same genome. 
Table 3 sxunmarizes the results from the genetic crosses between integrase (24.3, 24.4) and 
tester lines (29.2, 29.4,29.5, 29.19). In each case, representative progeny seedlings were 

30 germinated in the absence of selection and histochemically stained for GUS enzyme activity. 

36 


wo 01/07572 


PCTAJSOO/19983 


The table lists the number of progeny that stained blue. As the primary transfonned pWP24 
and pWP29 lines are hemizygous for their respective transgene, only a quarter of the 
progeny would be expected to carry both transgene types. The sample sizes were small, so 
an apparent deviation from the expected frequency is not unusual. 

Table 3: Progeny that showed gus expression from histochemical staining. 


Male Donor plant Female Recipient 

Number of 

Number of 

% positive for 

line 

plant line 

progeny stained 

progeny that 

gus activity 



for gus activity 

show gus 





activity. 


24.3+ 

29.2 

38 

11 

29% 

29.2 

24.3+ 

38 

1 

2.6% 


29.4 

24.3+ 

18 

3 

16% 


24.3+ 

29.5 

38 

4 

10% 

29.5 

24.3+ 

26 

0 

0 


24.4 

29.2 

38 

7 

19% 

29.2 

24.4+ 

38 

7 

19% 


29.4 

24.4+ 

19 

8 

42% 


24.4+ 

29.5 

38 

17 

45% 

29.5 

24.4+ 

20 

6 

30% 


29.19 

24.4+ 

18 

7 

39% 


The intensity of staining varied depending on the combination of lines used as 
10 parental lines. Those with progeny with a greater proportion of the tissue staining blue 

indicate that the recombination event was more eflBcient. Conversely, those yielding progeny 
with less uniform staining indicate that the recombination event was less eflBcient. This 
variation among the different progeny pools is probably due to effects caused by the position 
of integration of the transgenes. Of the two integrase lines, 24.4 appears more eflBcient in 
1 5 promoting site-specific recombination. This is probably due to a higher level of int gene 

expression. Staining patterns produced by crossing 24,4 to 29.4 and 29.19 are consistent with 
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the experimental design that int promoted site-specific recombination of attB and attP results 
in the activation of gus gene activity. 

Example 5 

<I>C31 Integrase Catalyzes Integration of a Circular Plasmid into a Plant Chromosome 

5 This example describes an experiment in which the OC31 integrase was 

tested for ability to insert a circular plasmid molecule into the plant chromosome through 
attP X attB site-specific recombination. This experiment is diagrammed in Figure 5. 

Methods 

The target constmct pWP6 contains the firagment consisting of 3 SS-attP-npU 
10 flanked by RB and LB, The site between 355 and np/ serves as a non-transk^^ 
sequence. Transcription of npt by 35S confers resistance to kanamycin. 

The integrating construct pYJC43 has the fiagment attB-hpt, where hpt codes 
for resistance to hygromycin. The integrase expression construct is pYJC41, in which 35S 
transcribes int. 

1 5 The target constmct pWP6 was placed into a plant chromosome through 

random integration of pWP6 DNA. Kanamycin resistant plants harboring a single copy of 
the pWP6 transgene are then subsequently transformed with pYJC43 and pYJC41. The 
transient expression of int firom pYJC41 was expected to catalyze the recombination 
between the attB site of pYJC43 and the chromosomally-situated attP site of the pWP6 

20 transgene. The specific recombination between attB znd attP sites would insert the pYJ C43 ^ 
circular molecule into the chromosome to generate a 35S-attL-hpt linkage. Note that 
because the attP and attB sites are depicted in the inverted orientation, the attL site will 
likewise be m an mverted orientation, or designated P*B, the same as BP' in the drawn in an 
inverted orientation. A fimctional 35S'attL'hpt hnkage would confer a hygromycin 

25 resistance phenotype. 

Results 

Kanamycin resistant tobacco plants harboring pWP6 were obtained through 
Agrobacterium-mediated transformation. Southern hybridization analysis detected one line 
that harbors a single copy of the pWP6 transgene. Progeny from this line, WP6.1, were 
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genninated aseptically and protoplasts were made from these plants. The protoplasts were 
transformed by the combination of pYJC43 and pYJC41 DNA by the polyethylene glycol 
method for direct DNA transformation. The protoplasts were then imbedded into agarose 
and cultured to form calli in the presence of hygromycin. The rate of callxis formation in the 
absence of hygromycin selection was 4 x 10-4. jhis is about 10 fold lower than usual, but is 
within the range of variability observed in protoplast transfonnation experiments. In the 
presence of hygromycin selection, the rate of callus formation was 7 x 10-5. xhis indicates 
that about 18% of the calli that regenerated from protoplasts contained the integration vector 
at the target site. When the integrase construct pYJC41 was excluded from the 
transformation, the rate of callus formation was <1 x 10'^. The higher frequency of 
hygromycin resistant calli produced by inclusion of the integrase expressing plasmid 
pYJC41 is consistent with the integrase promoted site-specific integration of pYJC43 into 
the chromosomal a«P target. 

It is understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in Ught thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview of 
this application and scope of the appended claims. All publications, patents, and patent 
applications cited herein are hereby incorporated by reference for all purposes. 
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WHAT IS g^AlMED IS ; 

1 1 . A eukaryotic cell that comprises a prokaryotic recombinase polypeptide 

2 or a nucleic acid that encodes a prokaryotic recombinase, wherein the recombinase can 

3 mediate site-specific recombination between a first recombination site and a second 

4 recombination site that can sen^e as a substrate for recombination with the first 

5 recombination site, but in the absence of an additional factor that is not present in the 

6 eukaryotic cell cannot mediate recombination between two hybrid recombinase 

7 recombination sites that are formed upon recombination between the first recombination site 

8 and the second recombination site. 

1 2. The eukaryotic cell of claim 1 , wherein the recombinase is selected 

2 from the group consisting of a bacteriophage OC3 1 integrase, a coliphage P4 recombinase, a 

3 Listeria phage recombinase, a bacteriophage R4 Sre recombinase, a CisA recombinase, an 

4 XisF recombinase, and a transposon Tn4451 TnpX recombinase. 

1 3. The eukafyotic cell of claim 1 , wherein the recombinase is a 

) 

2 bacteriophage OC31 mtegrase. 

1 4. The eukaryotic cell of claim 1, wherein the first recombination site is an 

2 attB site and the second recombination site is an attP site. 

1 5 . The eukaryotic cell of claim 1 , wherein the cell fiirther comprises a first 

2 recombinase recombination site. 

1 6. The eukaryotic cell of claim 1 , wherem the cell comprises a nucleic acid 

2 that comprises a coding sequence for an recombinase polypeptide, which coding sequence is 

3 operably linked to a promoter that mediates expression of the recombinase-encoding 

4 polynucleotide in the eukaryotic cell. 

1 7. The eukaryotic cell of claim 6, wherein the nucleic acid fiirther 

2 comprises a selectable marker. 
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1 8. The eukaryotic cell of claim 6, wherein the promoter is an inducible or a 

2 repressible promoter. 

1 9. The eiikaryotic cell of claim 8, wherein the nucleic acid is the plasmid 

2 pLT43. 

1 10. The eukaryotic cell of claim 1 , wherein the eukaryotic cell is selected 

2 from the group consisting of an animal cell, a plant cell, a yeast cell, an insect cell and a 

3 fungal cell. 

1 11, The eukaryotic cell of claim 10, wherein the eukaryotic cell is a 

2 mammalian cell. 

1 12. The exikaryotic cell of claim 10, wherein the eukaryotic cell is present in 

2 a multicellular organism. 

1 13. A method for obtaining site-specific recombination in a eukaryotic ceU, 

2 the method comprising: 

3 providing a eukaryotic cell that comprises a first recombination site and 

4 a second recombination site, which second recombination site can serve as a substrate for 

5 recombination with the first recombination site; 

6 contacting the first and the second recombination sites with a 

7 prokaryotic recombinase polypeptide, resulting in recombination between the recombination 

8 sites, thereby forming one or two hybrid recombination sites; 

9 wherein the recombinase polypeptide can mediate site-specific 

10 recombination between the first and second recombination sites, but cannot mediate 

1 1 recombination between two hybrid recombination sites in the absence of an additional factor 

12 that is not present in the eukaryotic cell. 

1 14. The method of claim 13, wherein the eukaryotic cell is selected from the 

2 group consisting of a yeast cell, a fungal cell, a plant cell, an insect cell and an animal cell. 
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1 15. The method of claim 13, wherein the first recombination site is present 

2 in a chromosome of the eukaryotic cell. 

1 16. The method of claim 15, wherein the second recombination site is 

2 present in a second chromosome of the eukaryotic cell and contacting the first and second 

3 recombination sites with the recombinase results in translocation of chromosome aims. 

1 17. The method of claim 13, wherein the first recombination site and the 

2 second recombination site are present on a single nucleic acid molecule. 

1 18. The method ofclaim 17, wherein the first recombination site and the 

2 second recombination site are in a direct orientation. 


1 19. The method ofclaim 18, wherein the recombination results in excision 

2 of the portion of the nucleic acid molecule that Ues between the first and second 

3 recombination sites. 

1 20. The method ofclaim 17, wherein the first recombination site and the 

2 second recombination site are in an inverted orientation. 

1 21. The method of claim 20, wherein the recombination results in inversion 

2 of the portion of the nucleic acid molecule that Kes between the first and second 

3 recombination sites. 

1 22. The method ofclaim 13, wherein the eukaryotic cell comprises a 

2 polynucleotide that encodes the recombinase polypeptide. 

1 .23 . The method of claim 22, wherein the recombinase-encoding 

2 polynucleotide is operably linked to a promoter which mediates expression of the 

3 polynucleotide in the eukaryotic cell. 
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1 24. The method of claim 23, wherein the promoter is an inducible or a 

2 repressible promoter. 

1 25 . The method of claim 24, wherein the promoter is a Pmnt promoter. 

1 26, A method for obtaining a eukaryotic cell having a stably integrated 

2 transgene, the method comprismg: 

3 introducing a nucleic acid into a eukaryotic cell that comprises a first 

4 recombination site, wherein the nucleic acid comprises a transgene and a second 

5 recombination site which can serve as a substrate for recombination with the first 

6 recombination site; and 

7 contacting the first and the second recombination sites with a 


8 prokaryotic recombinase polypeptide, wherein the recombinase polypeptide catalyzes 

9 recombination between the first and second recombination sites, resulting in integration of 

10 the nucleic acid at the first recombination site, thereby forming a hybrid recombination site 

11 at each end of the nucleic acid; 


12 wherein the recombinase polypeptide can mediate site-specific 

13 recombination between the first and second recombination sites, but cannot mediate 

14 recombination between two hybrid recombination sites in the absence of an additional factor 

15 that is not present in the eukaryotic cell. 

1 27. The method of claim 26, wherein the recombinase polypeptide is 

2 ^elected fsom the group consistmg of a bacteriophage <I>C3 1 integrase, a coliphage P4 

3 recombinase, a Listeria phage recombinase, a bacteriophage R4 Sre recombinase, a CisA 

4 recombinase, an XisF recombinase, and a transposon ln4451 TnpX recombinase. 

1 28. The method of claim 27, wherein the recombinase is a OC3 1 integrase. 

1 29. The method of claim 26, wherein the recombinase polypeptide is 

2 introducedintotheeukaryoticcellby expression of a polynucleotide that encodes the 

3 recombinase polypeptide. 
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30. The method of claim 29, wherein the polynucleotide that encodes the 
recombinase polypeptide is operably linked to a promoter that functions in the eukaryotic 
cell. 

3 1 . The method of claim 30, wherein the promoter is an inducible or a 
repressible promoter. 

32. A nucleic acid that comprises a polynucleotide sequence that encodes a 
bacterial recombinase polypeptide operably linked to a promoter that functions in a 
eukaryotic cell, wherein the recombinase polypeptide cannot mediate recombination between 
two hybrid recombination sites that are formed upon recombination between a first 
recombination site and a second recombination site in the absence of an additional factor. 

33. The nucleic acid of claim 32, wherein the nucleic acid further comprises 
at least one recombination site that is recognized by the recombinase polypeptide. 

34. The nucleic acid of claim 32, wherein the nucleic acid comprises a 
plasmid vector. 

35. The nucleic acid of claim 34, wherein the vector is pLT43. 

36. A eukaryotic cell that comprises a polynucleotide that comprises a first 
2 bacteriophage <t^C3 1 recombination site. 

1 37. The eukaryotic cell of claim 36, wherein the recombination site is 

2 selected fi-om the group consisting of attP and attB, 

1 38. The eukaryotic cell of claim 36, wherein the eukaryotic cell further 

2 comprises a second polynucleotide that comprises a second <3>C3 1 recombination site that 

3 undergoes recombination with the first 003 1 recombination site when contacted with a 

4 OC3 1 integrase polypeptide. 
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1 39. The eukaryotic cell ofclaim 38, wherein: 

2 the first recombination site is attB and the second recombination site is 

3 attP; or 

4 the first recombination site is attP and the second recombination site is 

5 attB, 

1 40. The eukaryotic cell of claim 38, wherein the second polynucleotide 

2 fiirther comprises a transgene. 

1 41. The eukaryotic cell ofclaim 38, wherein the second polynucleotide 

2 fiirther comprises a selectable marker. 

1 42. The eukaryotic cell ofclaim 36, wherein the eukaryotic cell fiirther 

2 comprises a <I>C31integrase polypeptide. 

1 43. The eukaryotic cell of claim 36, wherein the eukaryotic cell fiirther 

2 comprises a nucleic acid that comprises a polynucleotide that encodes a <I>C3 1 mtegrase 

3 polypeptide. 

1 44. The eukaryotic cell ofclaim 43, wherein the nucleic acid fiirther 

2 comprises a selectable maricer. 

1 45. The eukaryotic cell ofclaim 43, wherein the nucleic acid fiirther 

2 comprises a promoter which results in expression of the a>C31 integrase-encoding 

3 polynucleotide in the cell. 

1 46. The eukaryotic cell of claim 45, wherein the promoter is an inducible 

2 promoter. 

1 47. The eukaryotic cell of claim 36, wherein the eukaryotic cell is selected 

2 from the group consisting of a yeast ceU, a fimgal cell, a plant cell, and an animal ceU. 
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Figure 1 
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F^re 2 

Transgene Integration in CHO Cell Line 
Site Specific Expression of GFP 



BB* 

GFP 

Ps 

neo 


%r/j^ PC 


int i&SS!{ Ps 


ze SSfiS^ 


Ps 

pp. 

Ps 

neo 


pFY 


0.6 kb 


Int 


Pc 

PB* 

GFP 

Ps 

neo 


BP' 

Ps 

zee 



-• 







Pc = human cytomegalovirus promoter 


= attP 

Ps = S V40 early promoter 

1 bbM 

= attB 

zeo = zeomycin resistance coding region 

Fpy] 

= attR 

neo = neomycin resistance coding region 

[bF] 

= attL 


2/5 

SUBSTITUTE SHEET (RULE 26) 


wo 01/07572 PCr/USDO/19983 

Figure 3 

Transgene Integration in CHO Cell Line 
Hygromycin Resistance from attB x attP recombination 
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Excision of DNA from Tobacco Genome 
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Figure 5 

Integration of DNA into the tobacco genome 
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